Benign vs Malignant Tumors (National Cancer Institute, 2001)
Major Differences: Circularity, nucleation, rigidity, size?
Hypotheses
The null hypothesis: benign tumors and malignant tumors have cells of the same average area.
The alternative hypothesis: malignant tumors have larger average cell areas.
The Variables of Interest and Test Statistic
The variables - diagnosis (whether the tumor is malignant or benign), - designated by an ‘M’ or a ‘B’ in the diagnosis column - mean tumor cell area - a calculated value for each sample in the area_mean column.
The test statistic is the difference in means between area in the benign and malignant tumor samples.
# A tibble: 2 × 2
diagnosis ave_area
<chr> <dbl>
1 B 463.
2 M 978.
So, it looks like the mean area of malignant tumor cells is larger than that of benign tumor cells. However, is that generalizable to other breast tumors? Off to the permutation test!
The observed difference in mean cell size between malignant and benign breast tumors was 515.479.
This difference did not occur once in 1,000 random permutations.
The extremely small p-value provides very strong evidence against the null hypothesis.
Therefore, the results suggest that all malignant breast cancer cells have larger average sizes than benign breast cancer cells.
Implications
This means that average cell size could potentially serve as a potential quantitative metric for the rapid and automated classification of tumor malignancy.
References
“Normal and Cancer Cells Structure: Image Details.” NCI Visuals Online, National Cancer Institute, (2001). visualsonline.cancer.gov/details.cfm?imageid=2512.
Street, W.N., Wolberg, W.H., & Mangasarian, O.L. “Nuclear feature extraction for breast tumor diagnosis.” (1993) Proc. SPIE 1905: Biomedical Image Processing and Biomedical Visualization. https://doi.org/10.1117/12.148698
Wolberg, W., Mangasarian, O., Street, N., & Street, W. “Breast Cancer Wisconsin (Diagnostic)” (1993) UCI Machine Learning Repository. https://doi.org/10.24432/C5DW2B